Credit Scoring using Semiparametric Methods
نویسنده
چکیده
Credit scoring methods aim to assess credit worthiness of potential borrowers to keep the risk of credit loss low and to minimize the costs of failure over risk groups. Standard parametric approaches as logistic discrimination analysis assume that the probability of belonging to the group of ”bad” clients is given by P (Y = 1|X) = F (βX), with Y = 1 indicating a ”bad” client and X denoting the vector of explanatory variables. We consider a semiparametric approach here, that generalizes the linear argument in the probability P (Y = 1|X) to a partial linear argument. This model is a special case of the Generalized Partial Linear Model E(Y |X, T ) = G{βX + m(T )} (GPLM) which allows to model the influence of a part T of the explanatory variables in a nonparametric way. Here, G(•) is a known function, β is an unknown parameter vector, andm(•) is an unknown function. The parametric component β and the nonparametric function m(•) can be estimated by the quasilikelihood method proposed in Severini & Staniswalis (1994). We apply the GPLM estimator mainly as an exploratory tool in a practical credit scoring situation. Credit scoring data usually provide various discrete and continuous explanatory variables which makes the application of a GPLM interesting here. We estimate and compare different variations of the semiparametric model in order to see how the several explanatory variables influence credit worthiness. In contrast to more general nonparametric approaches, the estimated GPLM models allow an easy visualization and interpretation of the results. The estimated curves indicate in which direction the logistic discriminant should be improved to obtain a better separation of ”good” and ”bad” clients.
منابع مشابه
Credit scoring in banks and financial institutions via data mining techniques: A literature review
This paper presents a comprehensive review of the works done, during the 2000–2012, in the application of data mining techniques in Credit scoring. Yet there isn’t any literature in the field of data mining applications in credit scoring. Using a novel research approach, this paper investigates academic and systematic literature review and includes all of the journals in the Science direct onli...
متن کاملFuzzy Apriori Rule Extraction Using Multi-Objective Particle Swarm Optimization: The Case of Credit Scoring
There are many methods introduced to solve the credit scoring problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favourite in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In this paper multi-objective particle swarm is applied to optimize fuzzy apriori rule base in credit scoring. ...
متن کاملFuzzy Apriori Rule Extraction Using Multi-Objective Particle Swarm Optimization: The Case of Credit Scoring
There are many methods introduced to solve the credit scoring problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favourite in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In this paper multi-objective particle swarm is applied to optimize fuzzy apriori rule base in credit scoring. ...
متن کاملImproved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring
In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...
متن کاملA case study on using generalized additive models to fit credit rating scores
We consider the estimation of credit scores by means of semiparametric logit models. In credit scoring, the fitted rating score shall not only provide an optimal classification result but serves also as a modular component of a (typically quite complex) rating system. This means in particular that a rating score should be given by a linearly weighted sum of rating factors. That way the rating p...
متن کامل